Multilingual Datasets

نویسندگان

چکیده

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Reassembling Multilingual Temporal News Datasets with Incomplete Information

Institutional investors are building increasingly more sophisticated algorithmic trading engines that account for textual as well as numerical information. To train these engines they need large datasets of information with highly accurate timestamps that cover long periods with differing trading conditions. Thus, the demand for temporal news datasets beyond the point where full archives are av...

متن کامل

A Multilingual Datasets Repository of the Hadith Content

Knowledge extraction from unstructured data is a challenging research problem in research domain of Natural Language Processing (NLP). It requires complex NLP tasks like entity extraction and Information Extraction (IE), but one of the most challenging tasks is to extract all the required entities of data in the form of structured format so that data analysis can be applied. Our focus is to exp...

متن کامل

Coping with highly imbalanced datasets: A case study with definition extraction in a multilingual setting

This paper addresses the task of automatic extraction of definitions by thoroughly exploring an approach that solely relies on machine learning techniques, and by focusing on the issue of the imbalance of relevant datasets. We obtained a breakthrough in terms of the automatic extraction of definitions, by extensively and systematically experimenting with different sampling techniques and their ...

متن کامل

Multilingual Pronunciat Improving Multilingual S

Multilinguality aspects are becoming increasingly important in the Automatic Speech Recognition (ASR) systems. It is apparent that coping with large variability of the speech signal is an even bigger challenge in multilingual ASR systems than it has been in conventional monolingual systems. In this paper, we address the importance of combining multilingual pronunciation modeling and acoustic mo...

متن کامل

Multilingual Grammar Resources in Multilingual Application Development

Grammar development makes up a large part of the multilingual rule-based application development cycle. One way to decrease the required grammar development efforts is to base the systems on multilingual grammar resources. This paper presents a detailed description of a parametrization mechanism used for building multilingual grammar rules. We show how these rules, which had originally been des...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: The Stata Journal: Promoting communications on statistics and Stata

سال: 2005

ISSN: 1536-867X,1536-8734

DOI: 10.1177/1536867x0500500203